UNED Online Reputation Monitoring Team at RepLab 2013
نویسندگان
چکیده
This paper describes the UNED’s Online Reputation Monitoring Team participation at RepLab 2013 [3]. Several approaches were tested: first, an instance-based learning approach that uses Heterogeneity Based Ranking to combine seven different similarity measures was applied for all the subtasks. The filtering subtask was also tackled by automatically discovering filter keywords: those whose presence in a tweet reliably confirm (positive keywords) or discard (negative keywords) that the tweet refers to the company [16]. Different approaches have been submitted for the topic detection subtask: agglomerative clustering over wikified tweets, co-occurrence term clustering [10] and an LDA-based model that uses temporal information. Finally, the polarity subtask was tackled by following the approach presented in [14] to generate domain specific semantic graphs in order to automatically expand the general purpose lexicon SentiSense [9]. We next use the domain specific sub-lexicons to classify tweets according to their reputational polarity, following the emotional concept-based system for sentiment analysis presented in [8]. We corroborated that using entity-level training data improves the filtering step. Additionally, the proposed approaches to detect topics obtained the highest scores in the official evaluation, showing that they are promising directions to address the problem. In the reputational polarity task, our results suggest that a deeper analysis should be done in order to correctly identify the main differences between the Reputational Polarity task and traditional Sentiment Analysis tasks. A final remark is that the overall performance of a monitoring system in RepLab 2013 highly depends on the performance of the initial filtering step. ? This research was partially supported by the Spanish Ministry of Education (FPU grant nr AP2009-0507 and FPI grant nr BES-2011-044328), the Spanish Ministry of Science and Innovation (Holopedia Project, TIN2010-21128-C02), the Regional Government of Madrid and the ESF under MA2VICMR (S2009/TIC-1542) and the European Community’s FP7 Programme under grant agreement nr 288024 (LiMoSINe).
منابع مشابه
Overview of RepLab 2013: Evaluating Online Reputation Monitoring Systems
This paper summarizes the goals, organization, and results of the second RepLab competitive evaluation campaign for Online Reputation Management Systems (RepLab 2013). RepLab focused on the process of monitoring the reputation of companies and individuals, and asked participant systems to annotate different types of information on tweets containing the names of several companies: first tweets h...
متن کاملRepLab 2013: Comparative Evaluation of Online Reputation Monitoring Systems and Components
We summarize the goals, organization, and results of the second RepLab competitive evaluation campaign for Online Reputation Management systems (RepLab 2013). RepLab 2013 focuses on the process of monitoring the reputation of companies and individuals, and asks participating systems to annotate different types of information on tweets containing the names of several companies. First, tweets hav...
متن کاملOverview of RepLab 2012: Evaluating Online Reputation Management Systems
This paper summarizes the goals, organization and results of the first RepLab competitive evaluation campaign for Online Reputation Management Systems (RepLab 2012). RepLab focused on the reputation of companies, and asked participant systems to annotate different types of information on tweets containing the names of several companies. Two tasks were proposed: a profiling task, where tweets ha...
متن کاملEntity-based Filtering and Topic Detection for Online Reputation Monitoring in Twitter Damiano Spina Valenti Master in Languages and Information Systems, Uned Doctoral Programme in Intelligent Systems
Programa de Doctorado en Sistemas Inteligentes Escuela de Doctorado de la UNED Doctor of Philosophy in Computer Science Entity-Based Filtering and Topic Detection for Online Reputation Monitoring in Twitter by Damiano Spina Valenti With the rise of social media channels such as Twitter —the most popular microblogging service— the control of what is said about entities —companies, people or prod...
متن کاملLIA@RepLab 2013
In this paper, we present the participation of the Computer Science Laboratory of Avignon (LIA) to RepLab 2013 edition. RepLab is an evaluation campaign for Online Reputation Management Systems. LIA has produced a important number of experiments for every tasks of the campaign: filtering, topic priority detection, Polarity for Reputation and topic detection. Our approaches rely on a large varie...
متن کامل